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(57) Abstract ; 

The present invention provides improved methods for manipulating recombinant DNA in gene cloning and expression. 
More specincally, the invention provides methods capable of altering a nucleic acid sequence present at the termmt of a target se- 
quence. 
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TITLE OF THE tNVENTIOy ; 

USE OF EXO-SAMPLE NUCLEOTIDES IN GENE CLONING 

FIELD OF THE INVPpriON: 

The invention relates to improved methods for 
5 manipulating recombinant DNA in gene cloning and 
expression* More specifically; the invention provides a 
method in which exo-sample nucleotides are used to alter 
either the 3 • or 5 ' teiTminus of the nucleic acid sequence 
of a target sequence * 

10 BACKGROUND OF THE INVENTION ; 

Recombinant DNA methodologies capable of amplifying 
purified nucleic acid fragments have long been recognized. 
Typically, such methodologies involve the introduction of 
a desired nucleic acid fragment into a DNA or RNA vector, 

15 the clonal amplification of the vector, and the recovery 
of the amplified nucleic acid fragment* Examples of such 
methodologies are provided by Cohen et al, (U.S, patent 
4,237,224), Maniatis, T. et al. . Molecular Cloning: A 
Laboratory Manual , Cold Spring Harbor Laboratory, 1982, 

2 0 etc. 

In some instances, the desired nucleic acid molecule 
can be readily obtained from a source material. The 
molecule can then be inserted into a suitable vector by 
either adding "linker molecules" (see Scheller et aJL,. , 
25 Science I2g;177-180 (1977)) or by treating the desired 
molecule with a restriction endonuclease* 
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In other instances, however, the desired nucleic acid 
molecule cannot be obtained froin a source material at a 
concentration or in an amount sufficient to permit gene 
cloning. In such a situation, it is necessary to amplify 
5 the nucleic acid molecule by, for example, template-* 
directed extension, prior to introducing it into a 
suitable vector. Primer extension can be mediated by the 
"polymerase chain reaction" ("PGR") , or other means* 

In the "polymerase chain reaction" or "PCR" the 
10 amplification of a specific nucleic acid sequence is 
achieved using two oligonucleotide primers complementary 
to regions of the sequence to be amplified (Figure 1) , 

The polymerase chain reaction provides a method for 
selectively increasing the concentration of a nucleic acid 
15 molecule having a particular sequence even when that 
molecule has not been previously purified and is present 
only in a single copy in a particular sample. The method 
can be used to amplify either single or double stranded 
DNA. 

20 Reviews of the polymerase chain reaction are provided by 

Mullis, K.B. (Cold Spring Harbor Svmp. Quant. Biol, 

11:263-273 (1986) ) ; Saiki, R.K. , et al. f Bio/Technolocrv 
3:1008-1012 (1985)); Mullis, K.B. . et al. ( Met. Enzvmol. 
155:335-350 (1987); Krlich H. et al. . (EP 50,424; EP 

25 84,796, EP 258,017,. EP 237,362); Mullis, K. (EP 201,184); 

Muliis K. et al. . (US 4,683,202); Erlich, H. (US 
4,582,788); and Saiki, R. et al. (US 4,683,194) all of 
which references are incorporated herein by reference) . 

The ability to incorporate a gene sequence into a 

3 0 suitable vector is typically performed using restriction 
endonucleases * Thus, the vector and the desired gene 
sequence are treated with a restriction nuclease capable 
of producing compatible termini which can then be ligated 
together to form a covalently closed vector molecule. 

3 5 Preferably, the restriction enzyme is selected such that 
its recognition site is not present in the desired gene 
sequence. 
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It would be desirable to be able to generally alter 
the nucleotide sequences of a desired target sequence in 
order to permit it to be cloned into a suitable vector 
without using oligonucleotide linkers/ adapters , and 
a regardless of the availability or suitability of 
restriction sites. The present invention provides methods 
suitable for accomplishing these goals, 

SUMHjOtY OF THE INVENTION ; 

The present invention provides improved methods for 
10 manipulating recombinant DNA in gene cloning and 
expression. More specifically, the invention provides 
methods capable of altering a nucleic acid sequence 
present at the termini of a target sequence. 

In detail, the invention provides a method for 
15 incorporating a double-stranded linear desired nucleic 
acid molecule into a double-stranded vector, comprising: 

(A) forming a modified desired nucleic acid molecule 
characterized in possessing a first region of pre-selected 
sequence at at least one terminus of a first strand, the 

2 0 sequence containing at least one dU residue; 

(B) treating the first region of pre-selected 
sequence under conditions sufficient to result in the 
removal of the uracil base of at least one of the dU 
residues, to thereby form a protruding terminus capable of 

25 hydrogen bonding to a complementary seguenceboth of the 
strands, on at least one strand of the modified desired 
molecule ; 

(C) incubating the modified molecule (B) in the 
presence of a modified vector having at least one 

30 protruding single-stranded terminus, and being capable of 
hydrogen bonding to at least one of the protruding 
terminus of the modified desired DNA molecule, to thereby 
incorporate the double-stranded linear desired nucleic 
acid molecule into the double-stranded vector. 
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The invention also provides the embodiment of the 
above method wherein only one terminus of the modified 
desired molecule contains the dU-containing sequence. 

The invention also provides the embodiments of the 
5 above methods wherein the terminus is a 3 ' terminus, or a 
5* terminus of the modified desired molecule. 

The invention also provides the embodiments of the 
above methods wherein two termini of the modified desired 
molecule contain the dU-containing sequence. The 
10 invention also provides the embodiments of the above 
method wherein both of the termini are 3 * termini or both 
of the termini are 5' termini of the modified desired 
molecule. 

The invention also provides the embodiments of the 
15 above methods wherein the termini of the first and second 
strands of the desired DNA molecule contain a plurality of 
dU residues. 

The invention also provides the embodiments of the 
above methods wherein in step (B) , the dU residues are 

20 treated with UDG under conditions sufficient to remove the 
uracil base of at least one of the dU residues, to therby 
form an abasic site, or wherein in step (B) additionally 
comprises treating the abasic site with Endonuclease IV 
under conditions sufficient to cleave the modified desired 

25 molecule at the abasic site. 

The invention also provides the embodiments of the 
above methods wherein the regions of pre-selected sequence 
of the modified desired DNA molecule are identical. 

The invention also provides the embodiments of the 

30 above methods wherein in step (C) , the two protruding 
single-stranded termini are produced through the action of 
a restriction endonuclease, or through the ligation of an 
oligonucleotide to the vector or by (I) adding to the 
vector: 

35 (i) a first region of pre-selected sequence at a 5' 

terminus of a first strand, the sequence containing at 
least one dU residue; 
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(ii) a second region of pre-selected sequence at a 5 * 
terminus of a second strand, the sequence containing at 
least one du residue; and 

(II) treating the first and second regions of pre- 
5 selected sequence under conditions sufficient to result in 
the rexQoval of the uracil base of at least one of the dU 
residues, to thereby form the modified vector having the 
protruding 3' termini. 

The invention further provides a circular nucleic 
10 acid molecule comprising: 

(A) a double-stranded linear or linearized vector 
molecule having two termini, A and B, each having a region 
of pre-selected sequence, and 

(B) a double-stranded desired nucleic acid molecule 
15 having two termini, I and II, each having a region of pre- 
selected sequence, 

wherein the region of pre-selected sectuence of a 
first strand of the vector molecule at termini A and the 
region of pre-selected sequence of a second strand of the 
20 desired nucleic acid molecule at termini I are hybridized 
to one another; and 

wherein the region of pre-selected sequence of a 
second strand of the vector molecule at termini B and the 
region of pre-selected sequence of a first strand of the 

2 5 desired nucleic acid molecule at termini II are hybridized 

to one another. 

The invention further provides a kit spepially 
adapted to contain in close compartmentalization a first 
container containing a double-stranded oligonucleotide, 

3 0 having at least one dU nucleotide at a terminus, of one 

strand, and a second container containing an -enzyme 
capable of removing a uracil base of the dU residue* 

The invention also provides the embodiments of the 
above kit which additionally contains a third container 
35 containing a linearized double-stranded vector having at 
least one protruding terminus, the terminus having a 
sequence which is substantially similar to the nucleotide 
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sequence of the dU-containing strand of the 
oligonucleotide , 

The invention also provides the embodiments of the 
above kit wherein the terminus is a 3 • or a 5 • terminus, 
or wherein two termini of the modified desired molecule 
contain the dU-containing sequence. 



BRIEF DESCRTPTION OF THE FIGURES 



Figure l describes the use of two oligonucleotides 
complementary to regions of the sequence to be amplified 
10 in a PCH amplification process. 

Figure 2 describes an embodiment wherein the exo- 
sample nucleotide is incorporated into one strand of a 
double-stranded oligonucleotide. The target molecule is 
depicted in Figure 2A, Figure 2B illustrates the 
15 modification of the desired molecule so as to result in 
the alteration of the terminus of the molecule. Figure 2C 
shows the production of a protruding 3' terminus. 

Figure 3 shows an embodiment wherein • the exo- 
nucleotide is incorporated into both strands of a double- 
20 stranded molecule, and used to produce a molecule having 
two modified termini. 

Figure 4A describes an embodiment wherein the exo- 
sample nucleotide is incorporated into one strand of a 
double-stranded oligonucleotide to modify the 5 • terminus 
25 of a molecule. The target molecule is depicted in Figure 
2A, Figure 4B illustrates the removal of the du-- 
containing sequence. 

Figure 5 shows an embodiment wherein the exo- 
nucleotide is incorporated into both strands of a double- 
3 0 stranded molecule, and used to produce a molecule having 
two modified termini. 

Figure 6 shows a depiction of a primer. 
Figure 7 shows the structure that is formed by 
hybridization between the primer and the target sequence 
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by virtue of the homology between the sequences of the 3 • 
hybridizing region and the target molecule. 

Figure 8A shows a depiction of a dU-containing 
primer. Figure SB shows a depiction of an embodiment 
wherein the entire primer contains exonucleotides. Figure 
8C shows a depiction of the hybridized structure formed 
between the target molecule and the primer. 

Figure 9 shows a depiction of an-exonucleotide- 
containing molecule that is in a form which can readily be 
readily inserted into a plasmid or other vector. 

Figure 10 shows the use of linkers to produce a 
linearized vector having protruding 3' termini. 

Figure 11 shows the use of pcr and exo-sample 
nucleotides to produce a linearized vector having 
15 protruding 3» termini. 

Figure 12 shows the structures resulting from the 
removal of the dU residues from modified molecules. 

Figure 13 shows the use of the disclosed method to 
form a circular vector molecule. Figure 13A shows the 
20 modified molecules after destruction of exo-sample 
nucleotide. Figure 13B illustrates the loss of base 
pairing capacity of the region of pre-seleoted sequence 
after the destruction of the exo-sample nucleotide. 
Figure 13 c illustrates the formation of the circular 
25 vector nolecule containing the modified desired sequence. 
In Figures 13A and 13B the upper depiction illustrates the 
structure of the modified desired molecule, and the. lower 
depiction illustrates the structure of the modified vector 
molecule . 



PBSCRIPTION OF THK PREPCTiwign EWBonTWTUT«- 

1. TERMS USED IN MOX^COUUR BIOLOGY 

In the description that follows, a niunber of terms 
used in molecular biology and nucleic acid amplification 
technology are extensively utilized. In order to provide 
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a clearer and consistent understanding of the 
specification and claims, including the scope to be given 
such terms, the following definitions are provided. 

"Amplification", as used herein, refers to any ia 
2iitro process for increasing the number of copies of a 
nucleotide sequence or sequences. Nucleic acid 

amplification results in the incorporation of nucleotides 
into DNA or RNA. PGR is an example of a suitalbe method 
for DNA amplification. As used herein, one amplification 
reaction may consist of many rounds of dna replication. 
For example, one PGR reaction may consist of 10-50 
"cycles" of denaturation and replication. 

"Nucleotide" as used herein, is a term of art that 
refers to a base-sugar-phosphate combination. Nucleotides 
are the monomeric units of nucleic acid polymers, i.e. of 
DNA and ENA. The term includes ribonucleoside 
triphosphates, such as rATP, rCTP, rGTP, or rUTP, and 
deoxyribonucleoside triphosphates, such as dATP, dCTP, 
dGTP, or dTTP. A "nucleoside" is a base-sugar 
20 combination, i.e. a nucleotide lacking phosphate. 

"Exo-sample nucleotide", as used herein, refers to a 
nucleotide which is generally not found in a sequence of 
DNA. For most DNA samples, deoxyuridine is an example of 
an exo-sample nucleotide. Although the triphosphate form 
25 of deoxyuridine, dUTP, is present in living organisms as 
a metabolic intermediate, it is rarely incorporated into 
DNA. When dUTP is incorporated into DNA, the resulting 
deoxyuridine is promptly removed in vivo by normal 
processes, e.g. processes involving the enzyme uracil DNA 
30 glycosylase (udg) (Kunkel, U.S. 4,873,192; Duncan, b.k. , 
The Engymgp 2iy:565-586 (1981), both references herein 
incorporated by reference in their entirety). Thus, 
deoxyuridine occurs rarely or never in natural DNA. it is 
recognized that some organisms may naturally incorporate 
35 deoxyuridine into DNA. For nucleic acid samples of those 
organisms, deoxyuridine would not be considered an exo- 
sample nucleotide. Examples of other exo-sample 
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nucleotides include bromodeoxyuridine, 7-inethylguanine, 
5 , 6-dihyro-5 , 6 d ihydroxydeoxy thymidine , 3- 
methyldeoxadenosine, etc. (see, Duncan, B.K., The Enzv^n^^ 
213^:565-586 (1981)). other exo-saiaple nucleotides will be 
evident to those in the art. For example, RNA primers 
used for DNA amplifications can be readily destroyed by 
alkali or an appropriate ribonuclease (RNase) . RNase H 
degrades rna of RNA: DNA hybrids and numerous single- 
stranded RNases are known which are useful to digest 
single-stranded RNA after a denaturation step. 

The presence of deoxyuridine , or any other exo-sample 
nucleotide, may be readily determined using methods well 
known to the art. A nucleic acid molecule containing any 
such exo-sample nucleotide is functionally equivalent to 
DNA containing only dA, dC, dG or dT (dT is referred to 
herein as T) in all respects, except that it is uniquely 
susceptible to certain treatments, such as glycosylase 
digestion. Numerous DNA glycosylases are known to the 
art. An exo-sample nucleotide which may be chemically or 
enzymatically incorporated into an oligonucleotide and a 
DNA glycosylase that acts on it may be used in this 
invention. DNA containing bromodeoxyuridine as the exo- 
sample nucleotide may be degraded by exposure to light 
under well-known conditions. 

The use of exoTsample nucleotides to remove potential 
contaminants from samples being subjected to pcr 
amplification is disclosed by Longo, M.c. et al . (geng 
22:125-128 (1990), Hartley, U.S. Patent No. 5, 035, 966) , 
herein incorporated by reference in their entirety. This 
3 0 reference discloses the use of either dU-containing 
oligonucleotides or dUTP in the PCR-direoted amplification 
of a target sequence. 

The "desired" or "target" gene sequence or nucleic 
acid molecule is the term used to designate the sequence 
which is to be either amplified, or incorporated into a 
vector (which may be circular or linear), in order to 
achieve the objectives of the present invention. The 
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sequence may be of any size or complexity. in general, 
some inforittation is known about the desired sequence, such 
that the sequences of its termini can be ascertained. Any 
molecule which can be amplified by pgr, or which has 
restriction sites at its tennini can be used as the 
desired or target sequences of the present invention, A 
"chimeric molecule is a vector (plasmid, cosmid, viral 
nucleic acid, etc) which has been modified to carry or 
contain the desired gene sequence. 

Two sequences are said to be "substantially similar 
in sequence" if they are both able to hybridize to the 
same oligonucleotide. 

The "terminus" of a nucleic acid molecule denotes a 
region at the end of the molecule. The term is not used 
herein as representing the final nucleotide of a linear 
molecule, but rather a general region which is at or near 
an end of a linear or circular molecule. 

Two termini of two nucleic acid molecules are said to 
be the "same denominated termini," if the both termini are 
either the 3' termini of the respective molecules or both 
termini are the respective 5 ' termini of the respective 
molecules. As used herein, the term "same denominated 
termini," is not intended to refer to the nucleotide 
sequence of the termini being compared. 
2 5 As used herein, a DNA molecule is said to be 

"circular" if it is capable of depiction as either a 
covalentiy closed circle, or as a hydrogen bonded circle, 
A circular molecule may thus be composed of one or more 
polynucleotides bonded to one another via covalent or* 
30 hydrogen bonds. The terminal nucleotide (s) of each 
polynucleotide may either be single-stranded, or may be 
bonded to another polynucleotide via covalent or hydrogen 
bonds . 

"Uracil DNA glycosylase" (UDG) , a term of art, refers 
35 to an activity which cleaves the glycosidic bond between 
the base uracil and the sugar deoxyribose, only when the 
monomer ic nucleotide dUTP is incorporated into a DNA 
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molecule, resulting in incorporation of a deoxyuridine 
moiety (Duncan, B. in The Enzvmes 14:565 (1981), ed* : 
Boyer P) . An enzyme possessing this activity does not act 
upon free dUTP, free deoxyuridine, or RNA (Duncan, supra ) . 
5 The action of UDG results in the production of an "abasic" 
site. The enzyme does not, however, cleave the 
phophodiester backbone of the nucleic acid molecule. Most 
preferably, the phophodiester backbone at an abasic site 
may be claeved through the use of an endonuclease specific 
10 for such substrates. A preferred enzyme for this purpose 
is the E. coli enzyme, Endonuclease IV. Most preferably, 
Endonuclease IV is used in conjunction with UDG to remove 
dU residues from a nucleic acid molecule. 

"Incorporating" as used herein, means becoming part 
15 of a nucleic acid polymer. 

"Terminating" as used herein, means causing a 
treatment to stop. The term includes means for both 
permanent and conditional stoppages. For example, if the 
treatment is enzymatic, a permanent stoppage would be heat 
20 denaturation; a conditional stoppage would be, for 
example, use of a temperature outside the enzyme's active 
range. Both types of termination are intended to fall 
within the scope of this term. 

"Oligonucleotide" as used herein refers collectively 
25 and interchangeably, to two terms of art, "oligonucleotide" 
and "polynucleotide". Note that although oligonucleotide 
and polynucleotide are distinct terms of art, there, is no 
exact dividing line between them and they are used 
interchangeably herein. An oligonucleotide is said to be 
3 0 either an adapter, adapter/ linker or installation 
oligonucleotide (the terms are synonymous) if it is 
capable of installing a desired sequence onto a 
predetermined oligonucleotide. An oligonucleotide may 
serve as a primer unless it is "blocked.". An 
35 oligonucleotide is said to be "blocked," if its 3» 
terminus is incapable of serving as a primer. 
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"Oligonucleotide-dependent amplification" as used 
herein refers to amplification using an oligonucleotide or 
polynucleotide to amplify a nucleic acid sequence* An 
oligonucleotide-dependent amplification is any 
5 amplification that requires the presence of one or more 
oligonucleotides or polynucleotides that are two or more 
mononucleotide subunits in length and that end up as part 
of the newly-formed, amplified nucleic acid molecule, 

"Primer" as used herein refers to a single-stranded 
10 oligonucleotide or a single-stranded polynucleotide that 
is extended by covalent addition of nucleotide monomers 
during amplification. Nucleic acid amplification often is 
based on nucleic acid synthesis by a nucleic acid 
polymerase* Many such polymerases require the presence of 
15 a primer that can be extended to initiate such nucleic 
acid synthesis* A primer is typically 11 bases or longer; 
most preferably, a primer is 17 bases or longer. A 
minimum of 3 bases may, however, suffice, 

"Reaction volume" denotes a liquid suitable for 
2 0 conducting a desired reaction (such as amplification, 
hybridization, cDNA synthesis, etc) • 

A "ligase" is an enzyme that is capable of joining 
the 3' hydroxyl terminus of one nucleic acid molecule to 
a 5* phosphate terminus of a second nucleic acid molecule 

2 5 to form a single molecule. Ligase enzymes are discussed 

in Watson, J.D., In; Mol ecular Bioloov of the Gen^ , 3rd 
Ed*, W.A. Benjamin, Inc., Menlo Park, CA (1977), and 
similar texts. 

When an enzymatic reaction, such as a ligation or a 

3 0 polymerization reaction, is being conducted, it is 

preferable to provide the components required for such 
reaction in "excess" in the reaction vessel • "Excess" in 
reference to components of the amplification reaction 
refers to an amount of each component such that the 
35 ability to achieve the desired amplification is not 
limited by the concentration of that component. when 
linker/ adapter molecules are used after ligation, the 
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excess linker/ adapter present in the reaction is 
preferably separated from the reaction products, or 
removed from the reaction Taix, so that they will not 
compete with the cloning of the desired sequence. Use of 
5 linker/adapter oligonucleotides containing dU residues 
allows one to destroy excess linker /adapter molecules by 
enzymatic degradation or other means. 

The methods of the present invention are explained 
partly through illustration • In these illustrations , 
10 sequence pairs, such as A and A*, B and B», C and C*, 
X and X*, and Y and y», respectively, etc, are 
complementary to each other. Complementation need not be 
exact; homology sufficient for proper functioning, e.g. 
annealing and priming, will suffice. 

15 XX, THE METHODS AND HOXiKCOItES OF THE PRESENT INVENTION 

The present invention employs exo-sample nucleotides, 
most preferably the nucleotide dUTP (which, when incorpor-- 
ated into a nucleotide sequence is designated as dU) to 
create a 3 ' or 5 ' overhanging extension in the target 
20 nucleic acid molecules. The nucleic acid molecules can be 
derived from PCR, or other methods, or can be isolated 
directly from suitable source materials, 

A. MODIFICATION OF EITHER THE 3« OR 5» TERMINI OF 
A DESIRED NDCIiEIC ACID MOIiECOliE USING 
25 LINKER/ADAPTER MOLECOUES 



The present invention permits one to modify either 
the 3» or 5» termini of a desired nucleic acid molecule so 
as to create either a 5' or 3 ' single-stranded overhanging 
region. The invention accomplishes this goal through the 
3 0 use of exo-sample nucleotides, preferably dU, In a first 
embodiment, the exo-sample nucleotide is incorporated into 
one strand of a double-stranded oligonucleotide. This 
oligonucleotide is then ligated to a terminus or to both 
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termini of the desired molecule. Thus, if the target 
molecule is depicted as shovm in Figure 2A, then to modify 
the desired molecule so as to produce a protruding 3' 
terminus, an exo-sample nucleotide is ligated to that 
terminus (Figure 2B) . Treatment with UDG results in the 
removal of the uracil base of the dU residues, thereby 
producing abasic sites. The abasic sites can be cleaved 
with Endonuclease IV, or similar enzymatic activities, to 
produce the desired modified molecule (Figure 2C) . 

As will be readily recognized, it is possible to 
modify both termini of the desired molecule through the 
ligation of the dU-containing oligonucleotide to both ends 
of the molecule (Figure 3) . 

In order to modify the desired molecule so as to 
produce a protruding 5' terminus, an exo-sample nucleotide 
is ligated to that terminus (Figure 4A) . As in the above 
embodiment, treatment with UBG results in the removal of 
the uracil base of the dU residues, thereby producing 
abasic sites, which can be cleaved with Endonuclease IV, 
or similar enzymatic activities, to produce the desired 
modified molecule (Figure 4B) , Again, as in the above 
embodiment, it is possible to modify both termini of the 
desired molecule through the ligation of the dU-containing 
oligonucleotide to both ends of the molecule (Figure 5). 

most . preferred embodiment, however, the 
present invention employs PCR to modify the 5' termini of 
the desired molecule. 



B. MODIFICATION OF THE 5* TERMINX OF A DESIRED 
3° NITCI.EIC ACID MOLECDM: USIHG PGR AMPLIFICATION 

in a second embodiment for modifying the 5' termini 
of a desired molecule, so as to permit the production of 
a molecule having overhanging 3« termini, a variation of 
PGR amplification may be used* In this embodiment, the 
35 desired or target sequences of the present invention are 
modified using PGR so as to cause them to have 5' termini 
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which contain at least one, and preferably several exo- 
nucleotide molecules. For this embodiment, a specialized 
primer is employed. The specialized primer may be added 
at any stage, either initially in the PGR reaction, or 
5 after any number of cycles of amplification* When added 
after one or more cycles of PGR, the initial cycles of 
amplification are conducted using conventional primers. 

C. THE SPECIALISED PRIMERS OF THE PRESENT INVENTION 

The modification of the termini of the desired 

10 molecule is preferably accomplished using PGR with two 
specialized primers. 

Thus, each primer will be constructed such that it 
contains a 3' hybridizing region which is complimentary to 
a 5» region of one strand of a desired DNA molecule. The 

15 primers will also contain a region of predetermined and 
pre-selected sequence (whose length is in general of the 
same order of magnitude as the 3 » hybridizing region of 
the molecule) * Most preferably, the region of pre- 
selected sequence will be approximately 10-20, and most 

20 preferably approximately 12, bases in length- There are 
no constraints with regard to the nucleotide sequence of 
the pre-selected region of the primer molecule. The 
sequence can be either repetitive, palindromic, or unique. 
Each of the primers may thus be depicted as shown in 

25 Figure 6. 

As indicated, the primers will be capable of 
hybridizing to one strand of the target sequence by virtue 
of the homology between the sequences of the 3 ' 
hybridizing region and the target molecule (Figure 7). 

3 0 The 3' hybridizing region of the primer molecules 

need not be complementary to the precise termini of the 
target molecule. Indeed, the target molecule need not be 
a linear molecule. The purposes of the present invention 
will be achieved if the 3 » hybridizing region of the 

35 primer molecules is capable of hybridizing to a region 
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which either contains or flanks the sequence which is to 
he cloned. 

A characteristic of the pre-selected sequence is that 
it will contain a number of exo-sample nucleotides, which 
5 will be preferably interspersed throughout. In a 
preferred embodiment, the pre-selected sequence is 12 
bases long and every third base is a dU. The primer may 
thus be depicted as shown in Figure 8A. 

The 3» hybridizing region of the primer may also 
10 contain one or more exo-sample nucleotides. Indeed, in 
one embodiment of the invention the entire primer contains 
exo-sample nucleotides dispersed throughout (Figure 8B) , 
The hybridized structure formed between the target 
molecule and the primer may thus be depicted as shown in 
15 Figure sc. 

The primer or primers are thus incubated in the 
presence of a sample which contains, or is suspected of 
containing the desired nucleic acid molecule. PCR, or an 
alternative method is then carried out in the manners 
20 described above. After at least one amplification cycle, 
a desired molecule can be produced by permitting the 
primer-extension molecules to self hybridize. As will be 
recognized, the resultant molecule differs from the 
initial desired molecule in two respects. First, it 
25 contains at both of its termini additional sequences 
corresponding to the pre-selected sequence region. 
Second, the pre-selected sequence region at the 5» isnd of 
both of the amplified strands will contain the exo-sample 
nucleotides of the primer molecule (s). This molecule is 
in a form which can be readily inserted into a plasmid or 
other vector in accordance with the methods of the 
invention. The molecule may be depicted as follows shown 
in Figure 9. 
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D. THE VECTOR 

Any procaryotic or eukaryotic plasmid or viral vector 
can toe modified to permit its use in accordance with the 
methods of the present invention. Where the vector is a 
5 circular molecule, it is first linearized using, for 
example, a restriction endonuclease . The two termini of 
the linearized molecule are then altered to contain a 
sequence complimentary to one or tooth of the pre-selected 
sequences present in the amplified desired molecule* As 
10 in the case of the atoove-described primers, the termini of 
the linearized vector molecule can toe altered by any of a 
variety of methods. 

In a preferred embodiment, linkers can be used and 
ligated to the ends of the linearized molecule to produce 
15 the desired proruding 3' termini (Figure 10), 

Alternatively, PGR can be used to produce linearized 
vector molecules having the suitable termini. The termini 
of the vector are modified to contain an exo-sample 
nucleotide-containing sequence which is complementary to 
20 the exo-sample nucleotide-containing sequence of the 
modified desired molecule. Thus, where the exo-sample 
nucleotide-containing sequence of the modified desired 
molecule has the sequence "X" the exo-sample nucleotide- 
containing sequence of the modified vector shall have the 
25 sequence "X*" (Figure 11). 

As will be noted, the effect of these manipulations 
is to produce a linearized molecule having termini which 
contain a pre-selected sequence which is complimentary to 
the pre-selected sequence contained in either or both of 
30 the termini of the modified desired sequence. 
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E. THE CLONING OF THE MODIFIED DESIRED SEQUENCE 
INTO THE MODIFIED VECTOR 



The vector molecule and the desired molecule, as 
modified in the manner described above, are incubated 
5 under conditions which cause the destruction of the exo- 
sample nucleotide. Thus, for example, where the exo- 
sample nucleotide is dU, the molecules are subjected to 
treatment with the enzyme UDG. The resulting structures 
are shown below (Figure 12), As illustrated, treatment 
10 with UDG does not result in the scission of the 
phosphodiester backbone of the nucleic acid molecules. 
Rather, it results in the production of abasic sites which 
are thus incapable of base-pairing with complimentary 
sequences. The presence or absence of these abasic site- 
15 containing sequences is irrelevant to the subsequent 
application of the methods of the invention. 

Due to the complimentarity of the modified vector and 
desired molecule sequences, continued incubation of the 
modified vector with the modified desired sequences, after 
20 destruction of the exo-sample nucleotide, permits a 
chimeric molecule to form (Figure X3) . 

As will be appreciated, the use of sequences which 
are complimentary to only one end of the modified desired 
molecule permits one to insert the sequence in a uni- 
25 directional manner- Thus, for example, if one end of the 
modified desired molecule had a region of pre-selected 
sequence X and X*, and the other end of the molecule had 
a region of pre-selected sequence Y and Y ' , it would be 
possible to control the directionality of the insertion 
3 0 into the modified vector by employing a vector having 
regions of pre-selected sequence X' and X, and Y' and Y, 
respectively* 

Significantly, it is not necessary to remove the 
abasic regions from the circular molecule prior to 
35 transformation into a suitable microbial host. Similarly, 
it is not necessary to treat the circular molecule with a 



wo 92/22649 W PCr/US92/04922 

DNA ligase or other agent in order to produce a double- 
stranded covalently closed circular molecule . The 
circular molecule described above can be used directly to 
transform recipient cells. 
5 The methods of the present invention are especially 

amenable for use in in vitro procedures which utilize 
enzymes to amplify specific nucleic acid sequences and 
especially to PGR. 

The present invention includes articles of 

10 manufacture, such as "kits," such kits will, typically, 
be especially adapted to contain in close 

compartmentalization an instructional brochure; and a 
circular, or more preferably, linearized vector molecule 
whose 5» termini contain a region of pre-selected sequence 

15 which contains at least one exo-sample nucleotides. In a 
second embodiment, the kit will contain a modified vector 
molecule in which the exo-sample nucleotide has been 
destroyed to produce a protruding (i.e. overhanging) 3' 
terminus, or eguivalently, a non-recessed 5» terminus 

2 0 which is incapable of base pairing with a complementary 
sequence • 

In sub-embodiments of the above embodiments, the kit 
may also contain a container containing an exo-sample 
nucleotide-containing oligonucleotide suitable for use as 
25 a primer for modifying the termini of a desired nucleic 
acid molecule and/or a container which contains an enzyme 
capable of degrading an oligonucleotide which contains the 
exo-sample. nucleotide. The kit may additionally contain 
buffers, enzymes, and the like* 

Having now generally described the invention, the 
same will be more readily understood through reference to 
the following examples which are provided by way of 
illustration, and are not intended to be limiting of the 
present invention, unless specified. 
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Enzymes and reagents. 

Tag DNA polymerase was purchased from .Perkin Elmer- 
Cetus; dNTPs were from Boehringer Mannheim. Competent 
5 bacteria (DHIOB) , proteinase K, and restriction enzymes 
were from BRL. Oligonucleotides were synthesized using an 
ABI-3 8 0A DNA synthesizer and 

Amplification of vector and human cosmid DNA. 

All PGR reactions were 50 microliters covered with 
10 mineral oil using the following final buffer 
concentration: 50 mM KCl, 10 mM Tris-HCl (pH 8,4), 1.5 mM 
MgCl^ , and 0.2 mM of each dNTP. a Perkin Elmer-Cetus 
thermal cycler was used to generate Alu-PCR products as 
well as to analyze the inserts from subclones. After an 
initial 5 minutes at 93^ 35 cycles of i minute at 60^ i 
minute at 12^ and 1 minute at 93° were used. An additional 
5 minutes at 72** was used for the last cycle. Twenty to 3 0 
ng of each of the four Notl-linearized cosmids was 
amplified; 1 ng of Xbal-linearized pucii9 (5) was 
amplified as described above. Products from PGR reactions 
were analyzed by l % agarose gel electrophoresis in TAB 
buffer with ethidium bromide. 

UDG treatment. 

The vector and Alu-PCR products were precipitated 
25 with ethanol and dissolved in the following buffer (25 mM 
Tris-HCl [pH 7.8], ID mM MgCl^ , 4 mM betamercaptoethanol, 
0.4 mM ATP). Single-stranded 3* overhangs consisting of 
10 nucleotides in the vector and 11 nucleotides in the 
Alu-PCR products were made by treating vector (225 ng) and 
Alu-PCR products (110 ng to 212 ng) each separately with 
UDG (BRIi) in a final volume of 10 microliters for 10 
minutes at 37**. Initial experiments used 16 units of UDG, 
however as little as 1 unit has been found to be 
sufficient. A 10 minute treatment at 65** was used 
3 5 following the UDG treatment. 
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Cloning and transformation » 

UDG treated vector (45 ng) was combined with UDG 
treated Alu-PCR reaction products (45 ng to 106 ng) in a 
final volume of 2 0 microliters in the above Tris, MgCl2 
5 betamercaptoethanol, ATP buffer for one hour at room 
temperature. Five microliters from each combination were 
transformed in 50 microliters of DHlOB competent cells 
(BRL) following the manufacturers recommendations , and 
plated onto LB plates containing ampicillin, X-gal and 
10 IPTG. 



PCR analysis of transf ormants . 

Subclones were analyzed by PCR using the Alu primer. 
Single white colonies were dispersed into 12 microliters 
of 10 mM Tris"HCl (pH 7,5), 1 mM EDTA, 50 micrograms per 
15 ml proteinase K and incubated at 55" for 15 minutes, 8 0" 
for 15 minutes, and chilled on ice. pgr components 
including the Alu primer were added and amplified for 30 
cycles using the above protocol. Five microliters of each 
analysis was run on an agarose gel for sizing. 
20 Analysis of the transf ormants obtained using this 
procedure showed efficient cloning of PCR products using 
the exo-sample nucleotide cloning method. Control 
reactions where the insert DNA, or the vector DNA or the 
UDG treatment had been omitted resulted in no 
25 transformants, indicating that the cloning method was 
dependent on the procedure outlined and embodied in this 
application. 
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WHAT IS CLATMm TR- 

X. A method for incorporating a double-stranded linear 
desired nucleic acid molecule into a double-stranded 
vector, comprising: 
5 (A) forming a modified desired nucleic acid molecule 

characterized in possessing a first region of pre-selected 
sequence at at least one terminus of a first strand, said 
sequence containing at least one dU residue; 

(B) treating said first region of pre-selected sequence 
10 under conditions sufficient to result in the removal of 

the uracil base of at least one of said dU residues, to 
thereby form a protruding terminus capable of hydrogen 
bonding to a complementary sequenceboth of said strands, 
on at least one strand of said modified desired molecule; 

(C) incubating said modified molecule (B) in the 
presence of a modified vector having at least one 
protruding single-stranded terminus, and being capable of 
hydrogen bonding to at least one of said protruding 
terminus of said modified desired DNA molecule, to thereby 

20 incorporate said double-stranded linear desired nucleic 
acid molecule into said double- stranded vector. 



2. The method of claim 1 wherein only one terminus of 
said modified desired molecule contains said dU-containing 
sequence. 

3 . The method of claim 2 wherein said terminus is a 3 * 
terminus of said modified desired molecule. 

4. The method of claim 2 wherein said terminus is a 5 ' 
terminus of said modified desired molecule. 
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5. The method of claim 1 wherein two termini of said 
modified desired molecule contain said dU^containing 
sequence • 

6. The method of claim 5 wherein both of said termini 
5 are 3* termini of said modified desired molecule. 

7. The method of claim 5 wherein both of said termini 
are 5» termini of said modified desired molecule* 

8. The method of claim 1 wherein said termini of said 
first and second strands of said desired DNA molecule 

10 contain a plurality of dU residues, 

9. The method of claim 1 wherein in step (B) , said dU 
residues are treated with UDG under conditions sufficient 
to remove the uracil base of at least one of said dU 
residues, to therby form an abasic site. 

15 10. The method of claim 9 wherein step (B) additionally 

comprises treating said abasic site with Endonuclease IV 
under conditions sufficient to cleave the modified desired 
molecule at said abasic site. 

20 11, The method of claim 1 wherein said regions of pre- 

selected sequence of said modified desired DNA molecule 
are identical. 

12. The method of claim 1 wherein in step (C) , said two 
protruding single-stranded termini are produced through 

25 the action of a restriction endonuclease. 

13. The method of claim 1 wherein in step (C) , said two 
protruding single-stranded termini are produced through 
the ligation of an oligonucleotide to said vector. 
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14. The method of claim i wherein in step (C) , said two 
protruding single-stranded termini are produced toy 

(I) adding to said vector: 

(i) a first region of pre-selected sequence at 
a 5' terminus of a first strand, said sequence containing 
at least one dU residue; 

(ii) a second region of pre-selected sequence 
at a 5» terminus of a second strand, said sequence 
containing at least one dU residue; and 

(II) treating said first and second regions of pre- 
selected sequence under conditions sufficient to result in 
the removal of the uracil base of at least one of said dU 
residues, to thereby form said modified vector having said 
protruding 3* termini » 

15. A circular nucleic acid molecule comprising: 

(A) a double-stranded linear or linearized vector 
molecule having two termini, A and B, each having a region 
of pre-selected sequence, and 

(B) a double-stranded desired nucleic acid molecule 
having two termini, I and II, each having a region of pre- 
selected sequence, 

wherein the region of pre-selected sequence of a first 
strand of said vector molecule at termini A and the region 
of pre-selected sequence of a second strand of said 
desired nucleic acid molecule at termini I are hybridized 
to one another; and 

wherein the region of pre-selected sequence of a second 
strand of said vector molecule at termini B and the region 
of pre-selected sequence of a first strand of said desired 
nucleic acid molecule at termini II are hybridized to one 
another ♦ 

16. A kit specially adapted to contain in close 
compartmentalization a first container containing a 
double-stranded oligonucleotide, having at least one dU 
nucleotide at a terminus of one strand, and a second 
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container containing an enzyme capable of reinoving a 
uracil base of said dU residue, 

17. The kit of claim 16 which additionally contains a 
third container containing a linearized double-stranded 
5 vector having at least one protruding terminus, said 
terminus having a sequence which is substantially similar 
to the nucleotide sequence of said dU-containing strand of 
said oligonucleotide. 

18 • The kit of claim 16 wherein said terminus is a 3 ' 
10 terminus. 



19. The Kit of claim 16 wherein said terminus is a 5 * 
terminus* 
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